Chen, Hsinchun

Topic Weight	Topic Terms
1.384	intelligence business discovery framework text knowledge new existing visualization based analyzing mining genetic algorithms related
0.762	detection deception assessment credibility automated fraud fake cues detecting results screening study detect design indicators
0.661	data classification statistical regression mining models neural methods using analysis techniques performance predictive networks accuracy
0.553	research researchers framework future information systems important present agenda identify areas provide understanding contributions using
0.466	office document documents retrieval automation word concept clustering text based automated created individual functions major
0.285	search information display engine results engines displays retrieval effectiveness relevant process ranking depth searching economics
0.280	website users websites technostress stress time online wait delay aesthetics user model image elements longer
0.278	knowledge application management domain processes kms systems study different use domains role comprehension effective types
0.268	information systems paper use design case important used context provide presented authors concepts order number
0.244	results study research experiment experiments influence implications conducted laboratory field different indicate impact effectiveness future
0.232	enterprise improvement organizations process applications metaphors packaged technology organization help knows extends improved overcoming package
0.222	public government private sector state policy political citizens governments contributors agencies issues forums mass development
0.208	systems information research theory implications practice discussed findings field paper practitioners role general important key
0.206	approach analysis application approaches new used paper methodology simulation traditional techniques systems process based using
0.204	task fit tasks performance cognitive theory using support type comprehension tools tool effects effect matching
0.199	results study research information studies relationship size variables previous variable examining dependent increases empirical variance
0.192	research journals journal information systems articles academic published business mis faculty discipline analysis publication management
0.187	online uncertainty reputation sellers buyers seller marketplaces markets marketplace buyer price signaling auctions market premiums
0.186	interface user users interaction design visual interfaces human-computer navigation human need cues studies guidelines laboratory
0.183	analysis techniques structured categories protocol used evolution support methods protocols verbal improve object-oriented difficulties analyses
0.171	communication media computer-mediated e-mail richness electronic cmc mail medium message performance convergence used communications messages
0.170	evaluation effectiveness assessment evaluating paper objectives terms process assessing criteria evaluations methodology provides impact literature
0.169	design systems support development information proposed approach tools using engineering current described developing prototype flexible
0.162	identity norms identification symbolic community help sense european social important verification set identities form obtained
0.161	users user new resistance likely benefits potential perspective status actual behavior recognition propose user's social
0.156	use support information effective behaviors work usage examine extent users expertise uses longitudinal focus routine
0.153	data predictive analytics sharing big using modeling set power inference behavior explanatory related prediction statistical
0.148	web site sites content usability page status pages metrics browsing design use web-based guidelines results
0.143	electronic markets commerce market new efficiency suppliers internet changes marketplace analysis suggests b2b marketplaces industry
0.138	information environment provide analysis paper overall better relationships outcomes increasingly useful valuable available increasing greater
0.126	health healthcare medical care patient patients hospital hospitals hit health-care telemedicine systems records clinical practices
0.122	information management data processing systems corporate article communications organization control distributed department capacity departments major
0.122	development systems methodology methodologies information framework approach approaches paper analysis use presented applied assumptions based
0.117	process problem method technique experts using formation identification implicit analysis common proactive input improvements identify
0.110	percent sales average economic growth increasing total using number million percentage evidence analyze approximately does
0.104	online consumers consumer product purchase shopping e-commerce products commerce website electronic results study behavior experience
0.101	affective concepts role questions game gaming production games logic play shaping frames future network natural

■ Focal Researcher ■ Coauthors of Focal Researcher (1st degree) ■ Coauthors of Coauthors (2nd degree)

Note: click on a node to go to a researcher's profile page. Drag a node to reallocate. Number on the edge is the number of co-authorships.

Nunamaker, Jr., Jay F. 8	Abbasi, Ahmed 4	Brown, Susan A. 2	Zhang, Zhu 2
Chen, Yi-Da 1	Chiang, Roger H. L. 1	Chung, Wingyan 1	Chen, Yan 1
Donovan, Christina 1	Dang, Yan 1	Hu, Paul Jen-Hwa 1	Jen-Hwa Hu, Paul 1
Jr., Nicholas C. Romand 1	King, Chwan-Chuen 1	Kim, Jinwoo 1	Lin, Chienting 1
Li, Xin 1	Li, Jiexun 1	Storey, Veda C. 1	Zimbra, David 1
Zhang, Yulei 1	Zahedi, Fatemeh Mariam 1	Zeng, Daniel 1

design science 3	VISUALIZATION 3	Internet fraud 2	attitudes and purchase intentions 1
anti-aliasing 1	big data analytics 1	Business intelligence and analytics 1	business intelligence 1
Computer-mediated communication 1	CLUSTERING 1	CODING 1	citation analysis 1
classification 1	cognitive fit 1	cognitive load 1	credibility assessment 1
design framework 1	document clustering techniques 1	data mining 1	emerging infectious disease 1
experimental research 1	ELICITATION 1	electronic markets 1	Fake website detection 1
future sales predictions 1	group support systems 1	genetic algorithm 1	genre theory 1
information visualization 1	information systems development 1	intelligent information retrieval 1	Information Systems (IS) 1
information system evaluation 1	knowledge map 1	kernel-based method 1	knowledge management 1
knowledge mapping 1	loose coupling 1	machine learning. 1	multidimensional scaling 1
machine learning 1	outbreak management 1	online trust 1	public health information systems 1
patent management 1	phishing websites 1	phishing 1	qualitative data analysis (QDA) methodology 1
REDUCTION 1	SARS outbreak 1	statistical learning theory 1	self-organizing maps 1
SELECTION 1	stylometry 1	searching 1	text analysis systems 1
unsupervised learning algorithms 1	Website classification 1	Web 2.0 1	Web browsing 1
Web community 1	website genres 1

Articles (12)

Enhancing Predictive Analytics for Anti-Phishing by Exploiting Website Genre Information (Journal of Management Information Systems, 2015)

Authors:

Abbasi, Ahmed

Zahedi, Fatemeh Mariam

Zeng, Daniel

Chen, Yan

Chen, Hsinchun

Nunamaker, Jr., Jay F.

Abstract:

Phishing websites continue to successfully exploit user vulnerabilities in household and enterprise settings. Existing anti-phishing tools lack the accuracy and generalizability needed to protect Internet users and organizations from the myriad of attacks encountered daily. Consequently, users often disregard these tools' warnings. In this study, using a design science approach, we propose a novel method for detecting phishing websites. By adopting a genre theoretic perspective, the proposed genre tree kernel method utilizes fraud cues that are associated with differences in purpose between legitimate and phishing websites, manifested through genre composition and design structure, resulting in enhanced anti-phishing capabilities. To evaluate the genre tree kernel method, a series of experiments were conducted on a testbed encompassing thousands of legitimate and phishing websites. The results revealed that the proposed method provided significantly better detection capabilities than state-of-the-art anti-phishing methods. An additional experiment demonstrated the effectiveness of the genre tree kernel technique in user settings; users utilizing the method were able to better identify and avoid phishing websites, and were consequently less likely to transact with them. Given the extensive monetary and social ramifications associated with phishing, the results have important implications for future anti-phishing strategies. More broadly, the results underscore the importance of considering intention/purpose as a critical dimension for automated credibility assessment: focusing not only on the ÒwhatÓ but rather on operationalizing the ÒwhyÓ into salient detection cues. > >

Theory-Informed Design and Evaluation of an Advanced Search and Knowledge Mapping System in Nanotechnology. (Journal of Management Information Systems, 2012)

Authors:

Nunamaker, Jr., Jay F.

Abstract:

Effective search support is an important tool for helping individuals deal with the problem of information overload. This is particularly true in the field of nanotechnology, where information from patents, grants, and research papers is growing rapidly. Guided by cognitive fit and cognitive load theories, we develop an advanced Web-based system, Nano Mapper, to support users' search and analysis of nanotechnology developments. We perform controlled experiments to evaluate the functions of Nano Mapper. We examine users' search effectiveness, efficiency, and evaluations of system usefulness, ease of use, and satisfaction. Our results demonstrate that Nano Mapper enables more effective and efficient searching, and users consider it to be more useful and easier to use than the benchmark systems. Users are also more satisfied with Nano Mapper and have higher intention to use it in the future. User evaluations of the analysis functions are equally positive.

BUSINESS INTELLIGENCE AND ANALYTICS: FROM BIG DATA TO BIG IMPACT. (MIS Quarterly, 2012)

Authors:

Chen, Hsinchun

Chiang, Roger H. L.

Storey, Veda C.

Abstract:

Business intelligence and analytics (BI&A) has emerged as an important area of study for both practitioners and researchers, reflecting the magnitude and impact of data-related problems to be solved in contemporary business organizations. This introduction to the MIS Quarterly Special Issue on Business Intelligence Research first provides a framework that identifies the evolution, applications, and emerging research areas of BI&A. BI&A 1.0, BI&A 2.0, and BI&A 3.0 are defined and described in terms of their key characteristics and capabilities. Current research in BI&A is analyzed and challenges and opportunities associated with BI&A research and education are identified. We also report a bibliometric study of critical BI&A publications, researchers, and research topics based on more than a decade of related academic and industry publications. Finally, the six articles that comprise this special issue are introduced and characterized in terms of the proposed BI&A research framework.

Managing Emerging Infectious Diseases with Information Systems: Reconceptualizing Outbreak Management Through the Lens of Loose Coupling. (Information Systems Research, 2011)

Authors:

Abstract:

Increasing global connectivity makes emerging infectious diseases (EID) more threatening than ever before. Various information systems (IS) projects have been undertaken to enhance public health capacity for detecting EID in a timely manner and disseminating important public health information to concerned parties. While those initiatives seemed to offer promising solutions, public health researchers and practitioners raised concerns about their overall effectiveness. In this paper, we argue that the concerns about current public health IS projects are partially rooted in the lack of a comprehensive framework that captures the complexity of EID management to inform and evaluate the development of public health IS. We leverage loose coupling to analyze news coverage and contact tracing data from 479 patients associated with the severe acute respiratory syndrome (SARS) outbreak in Taiwan. From this analysis, we develop a framework for outbreak management. Our proposed framework identifies two types of causal circles—coupling and decoupling circles—between the central public health administration and the local capacity for detecting unusual patient cases. These two circles are triggered by important information-centric activities in public health practices and can have significant influence on the effectiveness of EID management. We derive seven design guidelines from the framework and our analysis of the SARS outbreak in Taiwan to inform the development of public health IS. We leverage the guidelines to evaluate current public health initiatives. By doing so, we identify limitations of existing public health IS, highlight the direction future development should consider, and discuss implications for research and public health policy.

DETECTING FAKE WEBSITES: THE CONTRIBUTION OF STATISTICAL LEARNING THEORY. (MIS Quarterly, 2010)

Authors:

Nunamaker, Jr., Jay F.

Abstract:

Fake websites have become increasingly pervasive, generating billions of dollars in fraudulent revenue at the expense of unsuspecting Internet users. The design and appearance of these websites makes it difficult for users to manually identify them as fake. Automated detection systems have emerged as a mechanism for combating fake websites, however most are fairly simplistic in terms of their fraud cues and detection methods employed. Consequently, existing systems are susceptible to the myriad of obfuscation tactics used by fraudsters, resulting in highly ineffective fake website detection performance. In light of these deficiencies, we propose the development of a new class of fake website detection systems that are based on statistical learning theory (SLT). Using a design science approach, a prototype system was developed to demonstrate the potential utility of this class of systems. We conducted a series of experiments, comparing the proposed system against several existing fake website detection systems on a test bed encompassing 900 websites. The results indicate that systems grounded in SLT can more accurately detect various categories of fake websites by utilizing richer sets of fraud cues in combination with problem-specific knowledge. Given the hefty cost exacted by fake websites, the results have important implications for e-commerce and online security.

Managing Knowledge in Light of Its Evolution Process: An Empirical Study on Citation Network--Based Patent Classification. (Journal of Management Information Systems, 2009)

Authors:

Nunamaker, Jr., Jay F.

Abstract:

Knowledge management is essential to modern organizations. Due to the information overload problem, managers are facing critical challenges in utilizing the data in organizations. Although several automated tools have been applied, previous applications often deem knowledge items independent and use solely contents, which may limit their analysis abilities. This study focuses on the process of knowledge evolution and proposes to incorporate this perspective into knowledge management tasks. Using a patent classification task as an example, we represent knowledge evolution processes with patent citations and introduce a labeled citation graph kernel to classify patents under a kernel-based machine learning framework. In the experimental study, our proposed approach shows more than 30 percent improvement in classification accuracy compared to traditional content-based methods. The approach can potentially affect the existing patent management procedures. Moreover, this research lends strong support to considering knowledge evolution processes in other knowledge management tasks.

Stylometric Identification in Electronic Markets: Scalability and Robustness. (Journal of Management Information Systems, 2008)

Authors:

Abbasi, Ahmed

Chen, Hsinchun

Nunamaker, Jr., Jay F.

Abstract:

Online reputation systems are intended to facilitate the propagation of word of mouth as a credibility scoring mechanism for improved trust in electronic marketplaces. However, they experience two problems attributable to anonymity abuse--easy identity changes and reputation manipulation. In this study, we propose the use of stylometric analysis to help identify online traders based on the writing style traces inherent in their posted feedback comments. We incorporated a rich stylistic feature set and developed the Writeprint technique for detection of anonymous trader identities. The technique and extended feature set were evaluated on a test bed encompassing thousands of feedback comments posted by 200 eBay traders. Experiments conducted to assess the scalability (number of traders) and robustness (against intentional obfuscation) of the proposed approach found it to significantly outperform benchmark stylometric techniques. The results indicate that the proposed method may help militate against easy identity changes and reputation manipulation in electronic markets.

CYBERGATE: A DESIGN FRAMEWORK AND SYSTEM FOR TEXT ANALYSIS OF COMPUTER-MEDIATED COMMUNICATION. (MIS Quarterly, 2008)

Authors:

Abbasi, Ahmed

Chen, Hsinchun

Abstract:

Content analysis of computer-mediated communication (CMC) is important for evaluating the effectiveness of electronic communication in various organizational settings. CMC text analysis relies on systems capable of providing suitable navigation and knowledge discovery functionalities. However, existing CMC systems focus on structural features, with little support for features derived from message text. This deficiency is attributable to the informational richness and representational complexities associated with CMC text. In order to address this shortcoming, we propose a design framework for CMC text analysis systems. Grounded in systemic functional linguistic theory, the proposed framework advocates the development of systems capable of representing the rich array of information types inherent in CMC text. It also provides guidelines regarding the choice of features, feature selection, and visualization techniques that CMC text analysis systems should employ. The CyberGate system was developed as an instantiation of the design framework. CyberGate incorporates a rich feature set and complementary feature selection and visualization methods, including the writeprints and ink blots techniques. An application example was used to illustrate the system's ability to discern important patterns in CMC text. Furthermore, results from numerous experiments conducted in comparison with benchmark methods confirmed the viability of CyberGate's features and techniques. The results revealed that the CyberGate system and its underlying design framework can dramatically improve CMC text analysis capabilities over those provided by existing systems.

A Visual Framework for Knowledge Discovery on the Web: An Empirical Study of Business Intelligence Exploration. (Journal of Management Information Systems, 2005)

Authors:

Chung, Wingyan

Chen, Hsinchun

Nunamaker, Jr., Jay F.

Abstract:

Information overload often hinders knowledge discovery on the Web. Existing tools lack analysis and visualization capabilities. Search engine displays often overwhelm users with irrelevant information. This research proposes a visual framework for knowledge discovery on the Web. The framework incorporates Web mining, clustering, and visualization techniques to support effective exploration of knowledge. Two new browsing methods were developed and applied to the business intelligence domain: Web community uses a genetic algorithm to organize Web sites into a tree format; knowledge map uses a multidimensional scaling algorithm to place Web sites as points on a screen. Experimental results show that knowledge map out-performed Kartoo, a commercial search engine with graphical display, in terms of effectiveness and efficiency. Web community was found to be more effective, efficient, and usable than result list. Our visual framework thus helps to alleviate information overload on the Web and offers practical implications for search engine developers.

A Methodology for Analyzing Web-Based Qualitative Data. (Journal of Management Information Systems, 2003)

Authors:

Jr., Nicholas C. Romand

Donovan, Christina

Chen, Hsinchun

Nunamaker, Jr., Jay F.

Abstract:

The volume of qualitative data (QD) available via the Internet is growing at an increasing pace and firms are anxious to extract and understand users' thought processes, wants and needs, attitudes, and purchase intentions contained therein. An information systems (IS) methodology to meaningfully analyze this vast resource of QD could provide useful information, knowledge, or wisdom firms could use for a number of purposes including new product development and quality improvement, target marketing, accurate 'user-focused' profiling, and future sales prediction. In this paper, we present an IS methodology for analysis of Internet-based QD consisting of three steps: elicitation, reduction through IS-facilitated selection, coding, and clustering; and visualization to provide at-a-glance understanding. Outcomes include information (relationships), knowledge (patterns), and wisdom (principles) explained through visualizations and drill-down capabilities. First we present the generic methodology and then discuss an example employing it to analyze free-form comments from potential consumers who viewed soon-to-be-released film trailers provided that illustrates how the methodology and tools can provide rich and meaningful affective, cognitive, contextual, and evaluative information, knowledge, and wisdom. The example revealed that qualitative data analysis (QDA) accurately reflected film popularity. A finding is that QDA also provided a predictive measure of relative magnitude of film popularity between the most popular film and the least popular one, based on actual first week box office sales. The methodology and tools used in this preliminary study illustrate that value can be derived from analysis of Internet-based QD and suggest that further research in this area is warranted.

Verifying the Proximity and Size Hypothesis for Self-Organizing Maps. (Journal of Management Information Systems, 1999)

Authors:

Lin, Chienting

Chen, Hsinchun

Nunamaker, Jr., Jay F.

Abstract:

The Kohonen Self-Organizing Map (SOM) is an unsupervised learning technique for summarizing high-dimensional data so that similar inputs are, in general, mapped close to one another. When applied to textual data, SOM has been shown to be able to group together related concepts in a data collection and to present major topics within the collection with larger regions. This article presents research in which the authors sought to validate these properties of SOM, called the Proximity and Size Hypotheses, through a user evaluation study. Building upon their previous research in automatic concept generation and classification, they demonstrated that the Kohonen SOM was able to perform concept clustering effectively, based on its concept precision and recall 7 scores as judged by human experts. They also demonstrated a positive relationship between the size of an SOM region and the number of documents contained in the region. They believe this research has established the Kohonen SOM algorithm as an intuitively appealing and promising neural-network-based textual classification technique for addressing part of the longstanding "information overload" problem.

GANNET: A Machine Learning Approach to Document Retrieval. (Journal of Management Information Systems, 1994)

Authors:

Chen, Hsinchun

Kim, Jinwoo

Abstract:

Information retrieval using probabilistic techniques has attracted significant attention on the part of researchers in information and computer science over the past few decades. In the 1980s, knowledge-based techniques also have made an impressive contribution to "intelligent" information retrieval and indexing. More recently, information science researchers have turned to other, newer artificial intelligence-based inductive learning techniques including neural networks, symbolic learning, and genetic algorithms. The newer techniques have provided great opportunities for researchers to experiment with diverse paradigms for effective information processing and retrieval. In this article we first provide an overview of newer techniques and their usage in information science research. We then present in detail the algorithms we adopted for a hybrid Genetic Algorithms and Neural Nets based system, called GANNET. GANNET performed concept (keyword) optimization for user-selected documents during information retrieval using the genetic algorithms. It then used the optimized concepts to perform concept exploration in a large network of related concepts through the Hopfield net parallel relaxation procedure. Based on a test collection of about 3,000 articles from DIALOG and an automatically created thesaurus, and using Jaccard's score as a performance measure, our experiment showed that GANNET improved the Jaccard's scores by about 50 percent and it helped identify the underlying concepts (keywords) that best describe the user-selected documents.

Chen, Hsinchun

Research Interests (37)

List of Topics (37)

Coauthor Network

List of Coauthors (23)

Article Keywords (62)

Articles (12)